Learning from missing data with the binary latent block model

نویسندگان

چکیده

Missing data can be informative. Ignoring this information lead to misleading conclusions when the model does not allow extracted from missing data. We propose a co-clustering model, based on binary Latent Block Model, that aims take advantage of nonignorable nonresponses, also known as Not At Random A variational expectation–maximization algorithm is derived perform inference and selection criterion presented. assess proposed approach simulation study, before using our voting records lower house French Parliament, where analysis brings out relevant groups MPs texts, together with sensible interpretation behavior non-voters.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Learning from Data Sets with Missing Labels

This paper consider the task of learning discriminative classifiers of data when some class labels are missing from the data set(so-called “semi-supervised” learning), specifically when the labeled data are not drawn from the same distribution as the unlabeled data. This is an important issue in domains in which learning from only the labeled samples can result in a classifier that is not appro...

متن کامل

Modeling Dyadic Data with Binary Latent Factors

We introduce binary matrix factorization, a novel model for unsupervised matrix decomposition. The decomposition is learned by fitting a non-parametric Bayesian probabilistic model with binary latent variables to a matrix of dyadic data. Unlike bi-clustering models, which assign each row or column to a single cluster based on a categorical hidden feature, our binary feature model reflects the p...

متن کامل

A latent class selection model for nonignorably missing data

A Latent-Class Selection Model for Nonignorably Missing Data Most missing-data procedures assume that the missing values are ignorably missing or missing at random (MAR), which means that the probabilities of response do not depend on unseen quantities. Although this assumption is convenient, it is sometimes questionable. For example, questionnaire items pertaining to sensitive information (e.g...

متن کامل

Multiple imputation for model checking: completed-data plots with missing and latent data.

In problems with missing or latent data, a standard approach is to first impute the unobserved data, then perform all statistical analyses on the completed dataset--corresponding to the observed data and imputed unobserved data--using standard procedures for complete-data inference. Here, we extend this approach to model checking by demonstrating the advantages of the use of completed-data mode...

متن کامل

The Limits of Learning with Missing Data

We study linear regression and classification in a setting where the learning algorithm is allowed to access only a limited number of attributes per example, known as the limited attribute observation model. In this well-studied model, we provide the first lower bounds giving a limit on the precision attainable by any algorithm for several variants of regression, notably linear regression with ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Statistics and Computing

سال: 2021

ISSN: ['0960-3174', '1573-1375']

DOI: https://doi.org/10.1007/s11222-021-10058-y